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Abstract. We investigate the length distribution of self-assembled, long and stiff 
Q , polymers at thermal equilibrium. Our analysis is based on calculating the partition 

^ ' functions of stiff polymers of variable lengths in the elastic regime. Our conclusion is 

that the length distribution of this self-assembled system follows closely the exponential 
distribution, except at the short length limit. We then discuss the implications of our 
results on the experimentally observed length distributions in amyloid fibrils. 
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1. Introduction 

Biopolymers, such as microtubules, actin filaments and amyloid fibrils, differ from 
synthetic polymers in that the persistence lengths are usually greater than the typical 
lengths of the biopolymers in the system. For instance, the persistence lengths, l p , of 
microtubules, actin filaments, sickle cell hemoglobin fibers and amyloid fibrils have been 
estimated to be in the order of 10 2 — 10 3 /imlil [lj, 16 //m [2], 10 3 fim [3] and 1 — 20 fim 
[H El E] , respectively. Another distinguishing feature of biopolymers from their synthetic 
counterparts is their relatively low polymerisation binding energies. For example, the 
binding energy for some amyloid fibrils have been estimated to be in the order of —10 
kcal/mol [8]. This relatively low binding energy implies that thermal equilibrium 
may be reachable within experimentally relevant time scale. At thermal equilibrium, 
the polymerisation process will reach a steady state and the length distribution of the 
biopolymers constitutes one of the defining characteristics of the system. In particular, 
the knowledge of the length distribution provides an estimate on the binding energy, and 
thus also the stability, of the biopolymer concerned. In the dilute regime where pairwise 
interactions between polymers can be ignored, mean-field theory predicts that the length 
distribution at thermal equilibrium is exponentially distributed j9j \10\ . Specifically, if 
4>l denotes the concentration of polymers of length L, then mean-field theory indicates 
that 

4>l oc e" L / Q (1) 

where a = (L) is the average length. In the mean-field approximation, each monomer 
within the polymer is considered identical. This is a coarse approximation as the 
contribution of each monomer to the partition function can potentially be depend on 
its position within the polymer. For instance, for long and flexible polymers, the above 
formula is modified due to the intra-chain volume exclusion interactions [HI [12] . Is the 
mean- field prediction also modified for stiff polymers in the regime l p > (L)? This is 
the question asked in this paper. Our analysis is based on mapping the calculations 
of the partition functions for stiff polymers to a quantum mechanical problem through 
the path integral formalism. Although this mapping has been employed extensively in 
the literature (e.g., see [T3J [HI [151 EE to the best of the authour's knowledge, 
this technique has never been employed to investigate the length distribution of self- 
assembled polymers. 

Besides the intrinsic importance in understanding the thermodynamic properties, 
this work is also motivated by the recent experiments on amyloid fibrillisation in which 
a variety of length distributions of amyloid fibrils formed from different proteins are 
observed [HI [191 EH EH [22]. We will comment on the implications of our results on 
these experimental observations in Section 3. 

| Note that in the case of micotubules, the persistence length has been shown to demonstrate to be 
dependent on the total length of the biopolymers in [I] 
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2. Model 

To calculate the length distribution, we minimize the free energy of the system with 
respect to the length distribution within the saddle-point approximation. In the dilute 
limit where interactions between polymers are negligible, we can write the total partition 
function as follows [23J: 

' (7.\ N * 

s iv «- 

where N s is the number of s-mers in the system. The prime in the product denotes the 
constraint 

oo 

Y,sN s = N (3) 

s=l 

where N being the total number of monomers. In equation (J2J), Z s denotes the partition 
that corresponds to a s-mer. Specifically, 

Z ' = ^/r, dXl '" dX ' e " , ' <W,/ " T - (4) 
where H{x} is the internal energy corresponding to the configuration {x}. The domain 
of integration, r s , is constrained in such a way that the configuration domain represents 
a s-mer. In equation we have also integrated out the kinetic part of the partition 
function and so A = hj \j2ixmkBT corresponds to the thermal wavelength of the 
monomer [23]. Since there are (s!) ways of arranging the monomers within the s-mer, 
it cancels with the factor (s!) in the denominator in equation (J3J) once the enumeration 
of the monomers within the polymer is fixed. To ease notation, we will also set ksT to 
one from now on. 

Given the total partition in equation (j2J), one can obtain the size distribution in 
thermal equilibrium by minimising the total free energy F = — ksT In Z tot with the 
constraint in equation ([3]) enforced by the Lagrange multiplier method. The resulting 
distribution is then of the form 

S cx Z s e- sX (5) 

where <p s is the concentration of s-mers in the system, and A is the Lagrange multiplier 
that enforces the conservation of the total monomer number [TH] El]- Let L s be the 
length of the s-mer, we will from now on set the unit of length to L s /s. As a result, <p s 
is equivalent to the length distribution in the system. 

Within the mean-field approximation, the integration in equation (j3J) is 
approximated as a product of s identical integrals and so we have Z s = VKA S for some 
constants A and K, and V corresponds to the total volume of the system. From equation 
05]), the mean-field approximation therefore predicts that the length distribution is 

S oc e - s{x - lnA) . (6) 

Namely, the length distribution is exponential, a well known mean-field results 
[To] EHl [24]. 
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Figure 1. The length distributions of self-assembled stiff polymers in thermal 
equilibrium according to the mean-field theory (c.f. equation ^) and the corrected 
version computed in this work (c.f. equation (1151) ). The average aggregation numbers, 
(s), are fixed to 20 in both cases. For the broken curve, K = 1 and e = 10. For 
parameters relevant to amyloid fibrillisation such that n, e, (s) 3> 1, the mean-field and 
the corrected results will be completely indistinguishable. 



We will now go beyond the mean-field approximation. To do so, one needs to 
account for the differential contributions to the s-mer partition function from different 
configurations of the s-mer. For long and stiff polymers, one can describe the energy of 
the polymer by its elastic constants for bending (e) and for stretching (k) j26j [131 127] - 
This picture entails the assumptions that the elastic constants are uniform throughout 
the polymers and are length independent. Based on this continuum approach, the 
partition function for a stiff polymer of length s is [261 H31 127] 



where D[x\ = IlfcXfc an d K is some constant that does not depend on s and will thus 
be ignored from now on. In equation ([7]), 



The first term in the above Hamiltonian is responsible for the self-assembly process, 
where fi < accounts for the binding energy per monomer, and 7 C s corresponds 
to the missing binding energy at the two ends of the polymer [Till 124"] . Furthermore, t 
denotes the contour length of the polymer, whose configuration is given by x(t). We 
also have u(t) = <9x/<9t and = |u(£)|. Note that the bending energy should strictly 
be dependent on the stretching as well, but the current form of the energy is valid for 
small elongation and contraction |27J. 




(7) 
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The integral in equation (|7j) can be seen as a path integral in the quantum mechanics 
setting [13j El- Specifically, if we fix the tangent vectors at the end points of the 
polymer in the configuration integral in equation (JJJ), then the path integral describes 
the evolution of the wavefunction, ip s (t,u), that satisfies the following Schrodinger-like 
equation: 



dips 
dt 



^V?ik-£(£-l)V. 



2e 
1 

Ye 



(9) 
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where u is expressed in terms of the polar coordinates (£, 0, 
for the above differential equation is known [T3] : 



e sin 2 9 



The Green's function 



G(a,Ml*',£',0',0')= E ex P 



X 



2e 



Rnp(ORnp(t,') 



(10) 
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The F nm are the normalized spherical harmonics defined as: 



(2n + l)(n-m)! 
^2vr(l + 5 0m )(n + m)! " l 



cosmi; 
sin mtf 



where are the Legendre polynomials. Furthermore, in equation ( fTU|) . P np are defined 
by the following eigenvalue equation with eigenvalue X np : 



1 d 2 



n(n + 1] 



P 



rip 



<^-l) 2 -A 
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up 



(12) 



Now, the s-mer partition function can be computed by integrating over all the 
possible choices for the two end points in the above Green's function. As a result, 

Z s = e-( s -~^V J dflctydf dfl'cty'df '£ 2 f 72 sin 6 sin 6'G{L S , f , 9, <f>\0, £, 6', 0') 

rn (2p + l)s~ 



ine-^-^VY^Cpexp 

p=0 



(13) 



where the expression in equation (fT3|) is valid in the limit eK 3> 1 [GO , and C p are defined 
as [THEE] 

2p\ 

(14) 



(^)*7*2P[(p/2)!P 



p even 
p odd . 



ne 2P[(p-l)/2]! ' 

Employing equation (jSJ), the length distribution is then 



oc e 



-s/ot 



1 + 



ne\ 
7 



1/4 



7T 



p>i 



(15) 



Due to the series summation in the square brackets, the constant a, set by the total 
concentration of the monomers in the system, no longer corresponds to the average 
length of the polymers. Furthermore, since C p > for all p, the effects of the bending 
and stretching of the polymers are to make the length distribution non-exponential (c.f. 
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fig. d]). In particular, in comparison to the mean-field prediction, the standard deviation 
in the length distribution of the polymers is decreased. 

Indeed, the summation in the square brackets constitutes the correction to the 
mean-field prediction. Physically, the correction terms stem from the ability of a short 
polymer to sense the boundary effects due to the rigidity (the term proportional to e in 
equation |8]) . Intuitively, this effect will disappear as the length increases, which is the 
case since the terms in the series are rapidly decreasing due to the form of C p and the 
exponential terms. In other words, the length distribution should be well described by 
the exponential distribution when the average length is much greater than 1. 

3. Relevance to amyloid fibrils 

In the case of self-assembled amyloid fibrils, experimental measurements on the length 
distribution has been carried by various groups [T8(, HHJ [201 EH E2] • A typical amyloid 
fibril has a persistence length in the order of 1 — 10 /iin and has an average length in the 
order of 1 /im in typical experimental conditions [H El El [29] . Various amyloid fibrils have 
also been shown to share similar mechanical properties [29]. In particular, the Young's 
modulus and bending rigidity for the insulin amyloid fibrils were found to be in the order 
of 3 GPa and 10 -25 Nm 2 respectively [1]. In our non-dimensionalised units, where k^T 
is set to one and the unit of length corresponds to the average fibril length per insulin 
protein, which is in the order 1 nm [3j, one finds that k ~ e ~ 10 4 . Therefore, we expect 
that our analysis performed here is appropriate for the study of amyloid fibrillisation. 
Specifically, we expect that the fibrillar length distribution at thermal equilibrium is well 
described by the exponential form. Experimentally, all measured length distributions 
seem to show broad distribution and exponential-like decay in the tail. This correlates 
well with the prediction. On the other hand, contrary to our prediction, peaks in the 
length distributions have also been observed in some cases [20j [2Tj. van Raaij et al. has 
interpreted the observed peaks as a result of the finite resolution of the atomic force 
microscopy imaging and length measurement procedure [20J. Besides this explanation, 
it is also known that it can take in the order of months for mature fibrils to form [30J. 
Therefore, the appearance of the peaks observed may also reflect the fact that the self- 
assembled systems have not yet reached thermal equilibrium due to the kinetic barrier 
in the nucleation process [2H EU E21 133] . 

4. Conclusion 

In summary, we have studied length distribution of self-assembled, long and stiff 
polymers. Our analysis is based on the calculation of the total free energy of the 
system and our conclusion is that at thermal equilibrium, the length distribution is well 
described by the exponential distribution, except in the short length limit. We have 
also discussed the implications of our results on the experimentally observed length 
distributions in amyloid fibrils. 
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The two main limitations of our investigations are the followings. First, our 
analysis applies only to self-assembled stiff polymers at thermal equilibrium. Indeed, for 
nonequilibrium systems, non-exponential length distribution can occur [311 EH]- Second, 
our approach applies strictly to the dilute limit where pairwise polymer interactions 
are negligible. As the concentration of polymers increases, the pairwise interactions 
will become important. If the interactions are purely steric, then an isotropic- nematic 
transition will occur as concentration increases. In this case, according to mean-field 
theory, the length distribution of the system will remain exponential although the 
average length will increase drastically as the transition occurs [361 S3 IB]- Since our 
analysis suggests that mean-field theory is a good approximation for self-assembled 
stiff polymers in the dilute limit, it may be reasonable to expect that the mean-field 
prediction will continue to hold in this scenario as well. 
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